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Speech  clipping  and  filtering  circuits  in  vpice  communica¬ 
tion  systems  so  distort  the  transmitted  speech  as  to  raise  a 
question  of  their  effect  on  intelligibility.  Results  of  several 
experiments  indicate  that  speech  clipping  alone  improves 
intelligibility  when  signals  are  partially  masked  by  electrical 
noise  entering  the  system  between  transmitter  andreceiver. 
Signals  which  are  not  so  masked  are  similarly  affected  to  al¬ 
most  the  came  degree.  Intelligibility  improves  as  clipping 
increases  up  to  at  least  24  db.  Under  extreme  noise  mask¬ 
ing,  very  heavy  clipping  (100  db  or  more)  improves  intelligi¬ 
bility,  but  reduces  intelligibility  of  signals  not  so  masked. 
Other  experiments  show  that  intelligibility  is  not  likely  to 
suffer  when  voice  frequencies  below  580  cycles  are  sharply 
attenuated*  with  the  possible  exception  that  when  masking 
white  noise  is  very  intense,  restoring  low  frequencies  caused 
a  slight  increase  of  intelligibility.  Attenuating  frequencies 
above  3900  cycles  has  no  significant  effect  on  intelligibility; 
however,  reducing  the  low-pass  cut-off  point  to  2500  cycles 
doed  lower  intelligibility  considerably.  It  is  suggested  that 
band-pass  filtering  limits  be  600  to  4000  cycles*  A  list  of 
references  is  appended* 
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PREMODULATION  SPEECH  CLIPPING  AND  FILTERING: 
A  CONSIDERATION  OF  THEIR  EFFECTS  ON  THE 
INTELLIGIBILITY  OF  SPEECH 


Over  the  past  ten  years,  the -radio  journals  have  pub¬ 
lished  extensively  on  amplitude-  and  frequency-limiting 
speech  amplifiers.  The  fundamental  purpose  of  such  equip¬ 
ment  has  been  the  improvement  of  transmission  efficiency 
in  radiophone  transmitters,  by  permitting  only  the  frequen¬ 
cies  essential  to  speech  to  modulate  the  transmitter*  and 
by  maintaining,  the  highest  average  proportion  of  modulation 
possible  without  exceeding  the  transmitter’s  linear  capability^ 
The  principal  devices  recommended  have  used  volume  com¬ 
pressors,  limiters,  clippers,  or  automatic  gain-control  net¬ 
works,  and  usually  incorporate  frequency  filters  which  elim¬ 
inate  all  but  the  band  occupied  by  typical  speech. 
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The- physics  and  economics  of  these  -amplitude-  and  fre¬ 
quency-restricting  amplifiers  have  apparently  been  generally 
accepted,  However,  there  seems  to  have  been  a  good  deal  of 
Unwillingness  to  employ  them,  regardless  of  their  technical 
advantages,  on  the  ground  that  they  so  distort  the  natural 
voice  as  to  jeopardize  intelligibility.  This  argument  does  not 
square  with  the  results  of  some  recent  experiments  designed 
to  study  exactly  this  problem:  how  do  clipping  and  filter¬ 
ing  affect  speech  intelligibility?  Indicative  of  this  work. 
Was  an  article  published  over  two  years  ago  in  (^ST  ,  Sev¬ 
eral  others  have  appeared  in  the  engineering  and  scientific 
literature  which  are  worth  the  amateur  phone  man’s  consid¬ 
eration,  This  paper  will  Undertake  a  review  of  these  articles, 
and  attempt  to  demonstrate  the  usefulness  of  their  results. 
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First,  let  us  refresh  our  knowledge  of  the  dynamics  of 
speech,  .When  the  human  voice  is  impressed  upon  a  micro¬ 
phone,  voltages  are- setup-whose  instantaneous  peaks  normal¬ 
ly  exceed  the  root-mean-square  value  by  12  to  15  db  ,  It  is 
this  "peak  factor"  which  requires  us  to  design  our  ampli¬ 
fiers  with  a  much  greater  range  of  linear  amplification 
than  we  expect  to  use  most  of  the  time;  when  we  wish  to 
transmit  voice  signals  with  minimum  distortion.  Also,  in' 
English  as  we  speak  it,  the  average  vowel  produces  a  peak 
voltage  which  runs  about  12  db  higher  than  that  produced  by 
the  average  consonant3*4,  This  12  db  figure  is  the  .average 
V  vowel-consonant  ratio  for  all  combinations  of  sounds  in  our 
language!  the  instantaneous  Value  may  vary  from  a  fraction 
of  the  average 'to  several  times  its  magnitude,  Nbw,  peculiar- 
ly  enough,  the  intelligibility  of  speech  depends  much, more 
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heavily  Upon  the  sounding  of  consonants  (b,  p,  z,  s,  t,  dj  f,  v, 
th,  k,  If  mi  n,  etc.)  than  upon  vowels  (a,  e»  i,  o,  u,  y,  etc.), 
despite  the  fact  thst  the  ordinary  vowel  sound  has  around  16 
times  the  power  of  the  usual  consonant. 

Effects  of  Clipping*  on  Intelligibility 


From  the  foregoing,  we  can  immediately  see  what  happens 
when  the  peaks  are  clipped  from  the  speech  wave,  ./t  one  and 
the  same  time  we  reduce  (l)  the  peak  factor ,  and  (2)thev0wel- 
to-consonant  ratio.  Effectively,  wc  have  cut  down  the  range 
of  variation  in  speech-energy  amplitude,  and  in  so  doing  have 
given  proportionally  greater  emphasis  to  consonants,  upon 
which  intelligibility  largely  depends,  as  wc  have  seen,  This 
would  lead  us  to  expect  that  we  might  improve  intelligibility 
by  the  use  of  clipping.  There  is,  on  the  other  hand,  the  pos¬ 
sibility  that  the  distortion  of  amplitudes  resulting  from  peak¬ 
clipping  might  actually  reduce  intelligibility,  This  is  the  gist 
of  the  question  for  which  answer  s  have  been  sought  in  the  psy¬ 
chological  laboratory,  using  some  tcchni  ]ues  of  measurement 
which  have  become  standard  in  studying  voice  communication. 
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Some  years  ago  the  Bell  Telephone  Laboratories  devised 
tests  to  measure  the  effects  of  telephone  circuits  on  the  intel¬ 
ligibility  of  speech.  A  talker  would  read  lists  of  syllables  or 
Words  made  up  of  all  the  sounds  of  the  English  language, in 
Various  combinations.  His  voice  was  then  transmitted  over 
a  telephone  circuit  to  a  group  of  listeners  who  would  write 
down  what  they  thought  the  talker  had  said.  By  comparing 
the  talker’s  original  list  with  the  listeners'  reproduction  of 
it,  a  percentage  could  be  Computed  representing  the  propor¬ 
tion  of  spoken  sounds  correctly  received  by  the  listeners  , 
as  circuit  conditions  were  systematically  changed  by  intro¬ 
ducing  various  degrees  Of  filtering,  attenuation,  nonlinear 
amplification  and  thelike.  Early  in  WorldWar  11,  this  meth¬ 
od  was  applied  by  a  group  of  psychologists  at  the  Harvard 
University  Psycho- Acoustic  Laboratory,  to  a  study  of  the 
effects  of  ptcrpodulatioh  clipping  upon  the  intelligibility  of 
speech  transmitted  over  a  miniature  radiophone  circuit.  Us¬ 
ing  standard  amplitude  modulation4,  Tjhe  results  of  this 
testing  show  that  in  the  absence  of  QRN  »  when  extremely 
Weak  undipped  signals  Were  ohly  about  30%  intelligible, 


^In  this  discussion  as  in  general  engineering  usage,  '‘clip¬ 
ping*  *  means  cutting  off  both  positive  and  negative  speech- 
wave  peaks  to  an  equal  extent,  then  reamplifying  the  result¬ 
ant  up  to  the  magnitude  it  would  ha^e  attained  without  clip¬ 
ping.  : 


+lri  the  international  code  of  tclcgfaph  operating  signals,  the 
;  letter  group  CRN  is  Used  in  reference  to  the  noises  produced 
by  atmospheric  static,  it  is  used  th  this  sense  throughout  .}' 
this  paper  as  a  matter  of  convenience,  v  ;  _  jj 
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using  24  db  of  clipping  would  raise  intelligibility  to  75%, 
These  percentages  represent  intelligibility  of  words  on  the 
special  lists;  the  equivalent  in  connected  meaningful  sen¬ 
tences  may  be  higher  '  ,  Such  an  advantage  in  favor  of 
clipping  also  holds  when  Q.RN  is  very  heavy,  to  almost  the 
same  degree.  Listeners  In  these  experiments  were  atsked  to 
report  on  change  of  voice  quality  as  clipping  increased. 
Here  is  tlieir  average  opinion:  at  0  db  clipping,  natural  voice; 
6  db,  clipping  effects  barely  noticeable  (comparable  to  stand¬ 
ard  broadcast  quality);  12  db,  talker  appeared  to  be  enunciat¬ 
ing  with  unusual  care;  18  db,  voice  took  on  a  sharp  “sandy" 
character,  quality  rated  not  as  good  as  before;  24  db,  voice, 
was  coarse  and  “grainy",  rated  as  poor.  Note,  however,- 
that  despite  the  very  evident  changes  in  voice-quality,  in¬ 
telligibility  actually  improved,  particularly  when  conditions 
were  less  than  optimal.  This  effect  has  been  noted  before*, 
although  perhaps  not  s6  explicitly  documented. 

The  question  arises,  .  “What  about  the  effects  of  clipping 
on  intelligibility  when  conditions  are  nearly  perfect?"  The 
most  definitive  answer  available  comes  from  another  series 
of  experiments6.  With  0  db  clipping,  signals  were  100% 
ihtelligible;  as  clipping  was  increased,  intelligibility  felh 
0.ff  .< slightly  until  at  20  db  clipping  it  had  reached  96%.  Clip¬ 
ping  was  gradually  advanced,  and  at  100  db  (almost  all  speech 
peaks  flattened  to  rectangles),  intelligibility  had  fallen  to 
75%^  (This,  remember.  Under  ideal  conditions  of  quiet  for 
both  talker  and  listeners,  with  no  fading  or  interference.) 
Incidentally, these  experiments  revealed  that  with  signaVhoise 
conditions  which  completely  obscured  unclipped  speech  (in¬ 
telligibility  at  0%  to  10%  ,  the  same  signal  when  clipped  lOOdb 
and  over  was  30%  intelligible.  In  these  experiments,  nothing 
was  said  about  changes  in  quality;  it  couldbe  expected,  how¬ 
ever,  that  with  such  severe  clipping  as  100  db  it  Would  be 
Very  hard  to  identify  the  talker  by  the  distinctive  sound  of 
hU  voice,  ; 


;;j' :  Effects  of  Filtering  on  intelligibility 


Up  to  this  point,  we  haVe  didcUssed  speech  ih  terms  of 
its  gross  amplitudes  only,  without  considering  the  individual 
frequencies  present  in  spoken  language.  For  reasons  dic¬ 
tated  by  .engineering  standards,  several  tecefat  amplifier  de¬ 
signs  have  included  both  high-  and  low-pass  filtering? 

Siriee  thi£  practice  isbeedming  more  widespread,  let  Us  ex¬ 
amine  itb  effect  on  intelligibility, 
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This  matter  has  also  been  Investigated  In  the  psycholog¬ 
ical  laboratory,. under  conditions  comparable  to  those  found 
in  the  amateur  phone  bands10*11.  Using  the  same  testing  pro¬ 
cedures  as  in  the  study  of  clipping  effects,  a  talker’s  voice 
was  transmitted  over  a  wire  circuit  to  a  group  of  listeners. 
The  speech  was  aubjoctedto  various  degrees  of  filtering  and 
attenuation,  and  was  then  combined  with  an  unflltered,  con¬ 
stant-intensity  thermal  noiso,  simulating  QlRN,  and  led  to 
the  listeners'  headphones,  .At  no  time  was  peak-clipping 
permitted  to  occur;  thus  the  effects  of  filtering  alone  could 
be  evaluated.  One  series  of  experiments,  studying  changes 
of  intelligibility  at  various  signal  levels  and  signal  noise 
ratios  when  either  high  or  low  frequencies  were  separately 
filtered  out10,  showed  that  when  everything  below  350  cycles 
was  cut  off,  intelligibility  of  moderately  to  very  strong  sig¬ 
nals  was  slightly  improved  by  comparison  with  unfiltered 
signals.  However,  at  the  lower  signal  levels,  where  t’jRN 
presumably  was  more  disturbing,  intelligibility  suffered1 
loss  as  a  result  of  such  filtering.  Extremely  weak  signals 
in  noise  were  5%  intelligible  when  the  350-cycle  high -pass 
filter  was  In  the  circuit,  but  jumped  to  25%  when  the  filter 
was  switched  out,  although  signal  strength  and  noise  level  re¬ 
mained  unchanged.  It  was  further  found  that  when  signals 
were  strong  and  in  the  clear,  everything  up  to  580  cycles- 
could  be  cut  off  with  little  damage  to  Intelligibility.  .As  to  cut¬ 
ting  off  the  highs,  when  everything  above  3950  cycles  was  e~ 
liminated  there  followed  very  little  reduction  of  intelligibility 
regardless  of  signal  strength.  However,  when  the  cutoff  point 
was  moved  down  to  2500  cycles,  results  were  quite  different. 
Whensignals  were  strong  and  clear,  intelligibility  was  down 
to  78%  with  the  filter  In,  as  compared  to  90%  with  no  filter-.* 
As  signals  grew  weaker,  the  proportional  loss  of  Intelligibil¬ 
ity  due  to  filtering  diminished  somewhat,  although  even  at  the 
lowest  signal  level  used  In  the  tests,  the  2500-cycle  low-pass 
filter  hampered  Intelligibility  appreciably. 


We  may  now  ask,  "What  happens  to  intelligibility  when 
we  filter  off  both  highs  and  lows  at  the  same  time?"  The  ef¬ 
fects  Of  band-pass  filtering  on  speech  in  a  noise  background 
have  been  separately  investigated11.  As  before,  unflltered 
constant-intensity  noise  was  superimposed  upon  the  filtered 
speech  signal,  which  was  also  varied  in.  strength -to  secure 
various  slgnal/nolse  ratios-*  As  might  be  expected  from  the 
discussion  of  high-  and  low-pass  filtering,  greatest  intelligi¬ 
bility  at  all  signal  atrehgths  resulted  when  the  widest  pass- 
band  was  used  (130-9200  cycles,  intelligibility  about  90%), 


The  effects  of  filtering  upon  intelligibility  were  most  notice¬ 
able,  as  before,  when  signals  were  strong  and  relative  noise- 
level  was  low.  Interestingly  enough,  atall  signal-levels,  the 
pass-bands  340-3900  cycles  and  550-3900  cycles  produced 
almost  identical  effects  on  intelligibility;  actually,  neither 
one  seriously  impaired  intelligibility  when- compared  with 
the  widest  pass-band.  However,  shitting  the  cutoff  points 
'lowarcTeach  other  clearly  resulted  in  poorer  intelligibility, 
as  the  following  table  shows.  Signal-strength  and  signal  noise 
ratio  are  the  same  for  all  filter  combinations. 


Fass-band  limits 

Intelligibility 

130-9200  cycles 

90% 

340-3900 

80 

550-3900 

80 

550-2500 

70 

870-3900 

65 

870-2500 

55 

There  have  been  several- other  investigations  of  filtering 
and  its  effect  upon  speech  intelligibility’*  .  However,  they 
deal  only  with  signals  transmitted  under  ideally  quiet, 
unfading  conditions  ■  for  both  talker  and  listeners  ;  theii*  re¬ 
sults,  therefore,  are  probably  not  as  pertinent  to  atnateur 
phone  operation  as  the  work  already  discussed, hencC  node- 
.  tailed  mention  of  them  has  been  made,  - 


I  •.  y  Summary 

Cn  the  basis  of  the  foregoing  evidence,  we  may  summar¬ 
ize  as  follows; 


1.  Speech-clipping  definitely  improves  intelligibility , 

2,  As  signals  get  weaker,  and  as  signal/noise  ratio  gets 
worse,  the  greater  the  clipping,  the  greater  the  improvement 
of  Intelligibility,  up  to  at  least  24  db  of  clipping. 

3.  Extremely  heavy  Clipping  (100  db  or  more)  is  bene¬ 

ficial  Urider  Very  severe  signal /noise  conditions,  although 
it  will  probably  hot  make  poor  signals  completely  understand¬ 
able,  1  • 

4,  Although  the  quality  of  speech  changes  noticeably 
oVer  the  clipping  range  from  0  db  to  24  db  (and  probably 
above)*  ijveri  undef  the  beet  signel  conditions  intelligibility 
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is  not  impaired  by  clipping. 

5.  lit  general,  high-pass  filtering  up  to  350  cycles  will 
not  harm  intelligibility,  and  may  actually  make  a  slight  im¬ 
provement  when  signals  are  strong  and  clear.  . 


6.  Under  optimum  signal  conditions,  frequencies  below 
580  cycles  maybe  eliminated  with  little  loss  of  intelligibility. 

7*  Cuttingoff  frequencies  above  3900  cycles  byUse  of  a 
loW-pasS.  filter  will  have  hardly  any  effect  on  intelligibility. 

8.  Cutting  off  frequencies  above  2500  cycles  will  ser¬ 
iously  impair  intelligibility. 

■  ,  Conclusions 


We  may  conclude,  therefore,  that  the  engineering  ad* 
Vantages  obtained  from  speech  clipping  prior  to  modulation 
are  accompanied  by  definite  improvement  of  intelligibility 
at  the  receiving  end  of  a  radio  circuit;  especially  under  ad¬ 
verse  operating  conditions.  Further,  the  change  in  voice- 
quality  noted  as  a  by-product  :of  clipping  does  not  re&lly  irri- 
pair  intelligibility  of  the  signal;  speech  canbc  distorted  very 
severly  by  nonlinear  transmission  and  still  be  perfectly  un¬ 
derstandable.  Filtering  to  avoid  or  remove  the  undesirable 
side-effects  of  clipping  will  not  impair  the  intelligibility  of 
speech  until  the  upper  cutoff  frequency  gets  down  around 
2500  cycles.  In  fact,  filtering  off  the  low  frequencies  (below 
350)  may  actually  improve  intelligibility  under  good  signal 
conditions.  The  limit  for  cutting  off  low  frequencies  is  ap* 
parently  much  less  critical  than  for  high  frequencies;  any 
cutoff  point  up  to  almost  600  cycles  may  serve  for  the  lows 
with  little  damage  to  intelligibility*  while  for  the  highs  cut¬ 
off  should  be  well  above  2500,  It  appears  now  that  the  more 
or  less  arbitrary  low-pass  cutoff  of  3000  cycles  now. rather 
widely  employed may  be  a  little  tdo  lew  for  optimum 
corrim Unication,  This  last  observation  assumes*  of  course, 
that  the  frequencies  above  nominal  cutoff  are  abruptly  and 
completely  attenuated*  It  may  very  well  be  that  intelligibility 
would  not  suffer  so  seriously  were  the  frequencies  above* 
say*  2000  cycles  subjected  to  the  relatively  gentle  treatment 
6f  the  typical  R-G  network*  i.e*i  3  to  6  db  attenuation  per 
octave.  ,  ■. 
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